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This invention relates to the development of recombinant eukaryotic clon- 
ing and expre sion vectors based on unique regulatory elements isolated from 
autonomously replicating, stable episomal units from human tumor cell lines. 
More specifically, the unique regulatory elements relate to origins of replication, 
as well as conferring extrachromosomal stability and maintenance. This cloning 
and expression vector will accommodate genes that exceed the cosmid limit 
(greater than 50 kb) and permit their maintenance as autonomously replicating 
extrachromosomal elements in mammalian cells. Inclusion of telomeres and cen- 
tromeres would control the replication and segregation and therefore serve as an 
eventual vehicle for gene replacement therapy. This invention is therefore unique 
in providing for the expression and autonomous replication of large genes, 
maintained extrachromosomally, in a vector containing episomal regulatory ele- 
ments. 
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A EUKARYOTIC EPISOMAL DNA CLONING AND EXPRESSION VECTOR 

10 



This invention relates to the development of 
recombinant eukaryotic cloning and expression vectors 

15 based on unique regulatory elements isolated from 

autonomously replicating, stable episomal units isolated 
from human tumor cell lines. More specifically, the 
unique regulatory elements include origins of DNA 
replication, and DNA sequences that confer 

20 extrachromosomal stability and maintenance. These unique 

episomal regulatory elements permit large pieces of DNA to 
be expressed or cloned (greater than 50 kilobase pairs 
[kb] in size) . 

25 During the past decade, the underlying significance 

of recent advances in molecular biology has been the 
ability to clone and manipulate DNA from virtually any 
source by ligating restriction fragments into phage or 
plasmid vectors which are then replicated in JS. coll. 

30 

Since then, a crucial technological gap has developed 
in what is commonly called "conventional recombinant DNA 
technology." This technological gap stems from two 
developments. The first is the discovery that many 
35 eukaryotic genes are encoded by enormous lengths of DNA. 
Th second is an optimistic and enthusiastic goal of 
mapping and sequencing entire genomes, including the human 
g nom . 
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Because of the large size of DNA in many genes from 
higher organisms, this size limitation and restriction can 
be stifling. For example, bithorax locus in Dropsophlla, 
which plays an active role in the fly's segmentation 
5 pattern, encompasses approximately 320 Jcb (Karch, et al., 
Sell 43:81, 1985). Factor VIII gene in the human which 
encodes the blood-clotting factor deficient in 
hemophiliacs, spans at least 190 kb (Gitschier, et al., 
Mature (Lgndon), 312:326, 1984). The gene that is 

10 defective in Duchenne's muscular dystrophy is estimated to 
include more than a million base pairs (1000 kb) . a 
striking feature of this gene is the protein-coding 
portion may be encoded by as little as 15 kb of DNA 
(Monaco, et al. Nature ( London! . 302:575, 1983). Thus, 

15 there is a strong need for technological advances which 
permit the cloning and expression of very large genes. 

Also widening this technological gap is the increased 
interest in and enthusiasm for gene replacement therapy. 

20 Proposals to use genes to treat cancer and immune 

deficiencies have only recently been approved by the 
National Institutes of Health human gene therapy 
subcommittee and the Recombinant DNA Advisory Committee 
( Science , 249:974, August, 1990). These first studies 

25 focus on: 

(1) delivering tumor necrosis factor (TNF) directly 
to a tumor site in much larger doses by 
packaging the gene for TNF inside special 

30 lymphocytes that have a natural 

affinity for tumors; and 

(2) attempting actual gene replacement therapy in 
children with a rare, inherited and often lethal 

35 immune system disorder caused by adenosin 

deaminase d ficiency. A normal healthy 
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recombinantly produced ADA gen will be 
introduced into the white blood cell of an ADA 
deficient child and the cells are then returned 
to the patient (Id^ 975) . 

5 

To narrow this gap, molecular biologists are 
attempting to clone large pieces of exogenous DNA into 
compatible hosts by means of artificial vectors. How ver, 
standard recombinant DNA techniques, that involve the 

10 construction of small plasmid vectors that can be 

transfected into host cells and clonal ly propagated, ar 
limited in the amount of exogenous DNA that can be 
"squeezed" or inserted into these vectors. These size 
restrictions only permit about 50 kilobase pairs (kb) to 

15 be cloned into the vectors usually employed in cloning. 

More limitations exist when the discussion turns to 
the bacterial expression of mammalian proteins. The 
current technology for expressing mammalian proteins in 
20 bacteria is hampered with problems relating to post 

translational modifications and functional bioactivity. 

To date, cloning of large segments of exogenous DNA 
in the range of several hundred kilobase pairs has only 

25 been achieved by employing yeast. This was done by 

ligating exogenous DNA to vector sequences that allow 
their propagation as linear artificial chromosomes (Burk , 
et al, Science , 236:806, 1987). Although this technique 
is a significant step towards resolving this size 

30 restriction, cloning large segments of exogenous DNA into 
yeast is not without limitations. Questions and concerns 
about this technology pertain to (1) the stability of the 
recombinants, (2) whether clone banks ar representative 
of the starting material, (3) whether the desired protein 

35 is consistently expr ssed in extra chromosomal v ct rs, and 
(4) wheth r n rmal human transcripts are prop rly 
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processed in yeast, as well as, whether proper expression 
and post translational modification of the recombinant 
protein occurs in yeast. 

5 Therefore, with the yeast expression system and its 

limitations, there is still a very strong need to design 
and construct eukaryotic expression and cloning vectors 
possessing the capabilities of housing very large regions 
of DNA (greater than 50 Jcb) and of accurately processing 
10 and expressing of these large genes. With such a novel 
vector, large regions of DMA that span genes can then be 
cloned and whole proteins encoded by the genes can th n b 
expressed. 

15 ° ne mechanism by which a cell can accumulate large 

amounts of specific protein or RNA is by amplification f 
the respective gene. This amplification may be located on 
either expanded chromosomal regions (homogenous staining 
regions) or on extrachromosomal autonomously replicating 

20 elements (called double minute, double minute chromosomes 

or episomes) . 

Episomes have unique features; the most notable are 
that episomes autonomously replicate and are stably 
maintained extrachromosomally. The characteristics of 
episomes permits the continuous production of the 
respective amplified gene and the gene products it 
encodes. For example, an episome produced in hamster 
cells has been characterized to contain amplified amounts 
of a transfected CAD (CAD is an acronym for the 
multifunctional protein containing carbamylphosphate 
synthetase, aspartate transcarbamylase, and 
dihydroorotase) gene at high frequency (Carrol, et al.. 
Molecular and Cellular Biolggy, 7(5):i740, 1987). The 
35 amplified CAD gene is produced with ach division of each 
c 11. 



25 



30 
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Viral episomes have also been identified. It has 
been demonstrated that papilloma viral DNA replicates like 
a plasmid in mouse cells. Circular bovine papilloma virus 
(BPV) DNA can transform certain mouse cell lines to a 
5 malignant phenotype. In these transformed cell lines, the 
BPV DNA remains circular and extrachromosomal at ab ut 30 
- 100 copies per cell. This "plasmid" is being stably 
maintained in higher eukaryotes. Desired genes may be 
inserted into the BPV DNA and be maintained in the 
10 plasm id- like state and high levels of mRNA and protein 

corresponding to the desired gene can be produced. It has 
also been shown that Epstein-Barr virus vectors contain 
sequences that provide extrachromosomal stability of 
episomal DNA as veil as origins of replication. This 
15 viral vector has been used to identify human DNA sequences 
that permit autonomous replication in human cells (Krysan, 
et al. # Molecular and Cellular Biology. 9(3): 1026, 1989). 
But, it can be appreciated that there are many limitati ns 
when working with a virally produced protein. For 
20 example, in terms of producing proteins that may 

ultimately be used to replace defective human genes, viral 
episomes probably are not feasible because of potential 
Food and Drug Administration regulations, etc. Also the 
viral episome eventually integrates into chromosomal sites 
25 which then interferes with continued amplification and 
causes the expression of its resident genes to be 
extinguished. 

Thus, the limitations in terms of integration into 
30 chromos >mal sites and of potential hazards pertaining to 
the use of viral based vectors for amplification and 
expression apply to all eukaryotic viral episomes. 

It is th int nt of this invention to describ a 
35 eukary tic cl ning and an xpr ssion v ctor which will 
accommodate g n s that xc ed the cosmid limit (gr ater 
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than 50 kb) and permit their accumulation and maint nance 
as autonomously replicating extr achr omosoma 1 elements in 
mammalian cells. This invention is therefore unique by 
providing autonomous replication and expression of large 
5 genes in a vector containing episomal regulatory elements. 

This minimal cloning or expression vector will be 
further modified by the inclusion of regions of human 
chromosomes containing telomeres and centromeres. This 

10 would thus create a human artificial chromosome that w uld 
be subjected to the same control mechanisms (regarding 
regulation and chromosomal segregation) as normal 
chromosomes and therefore serve as a vehicle for gene 
replacement therapy. This modification of the 

15 extrachromosomal vector is therefore unique in that it 

will be a synthetic chromosome containing genes of choic , 
that will be expressed, and that will be maintained and 
regulated as if it were a normal chromosome. 

20 This cloning or expression vector may take on several 

forms. For example, two principal forms for employment 
are: (1) employed via extrachromosomal/episomal, 
autonomous replication and segregation which could even be 
amplified, and (2) employed via a human artificial 

25 chromosome under normal chromosomal control mechanisms. 

In general and overall scope, the present invention 
relates to the development of recombinant eukaryotic 
cloning and expression vectors based on unique regulatory 

30 elements isolated from autonomously replicating, stable 

episomal units isolated from human tumor cell lines. More 
particularly, these unique regulatory elements include 
origins of DNA replication, and DNA sequences that confer 
extrachr m somal stability and maintenance. Th se unique 

35 episomal r gulatory elements will permit large piec s f 
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DNA to be expr ss d or cloned (greater than 50 kilobases 
pairs in size) . 

This invention discloses procedures for producing two 
5 different types of vectors. One is a cloning vector and 

the other one is an expression vector. For the purpose of 
this invention, the phrase "cloning vector" refers to a 
DMA vector designed to be used to clone a desired gene. 
The techniques that are involved in cloning vary from 
10 vector to vector and from system to system, however, thes 
techniques in general are standard and known to those 
skilled in the art of recombinant DNA technology. 

Also, for the purpose of this invention, the phrase 
15 "expression vector" refers to a DNA vector capable of 

replication in selected mammalian host cells and 
expressing a desired protein. This protein may then be 
recovered from the cells by employing techniques known to 
those skilled in the art. 

20 

This cloning vector should include one or more 
functional origins of DNA replication to permit stable, 
autonomous replication. The phrase "origin of 
replication" is defined as a region that indicates the 
25 origin of replication. 

This cloning vector should include appropriate DNA 
sequences that confer extrachromosomal stability and 
maintenance. The sequences responsible for conferring 

30 extrachrom somal stability and persistence may be related 
to sequences responsible for nuclear matrix attachment 
sites, topoisomerase II reaction sites, and/or other 
regions r quir d for appr priate interactions with the 
nucl ar architecture. This xtrachromosomal stability and 

J 5 maintenance permits the intr duct ion of larg exogenous 
genes. 
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This cloning vector should also include DNA 
selectable marker sequences that can be used to confer 
drug resistance to a transf ected cell or DNA sequences 
that can correct a genetic mutation. This allows the 
5 cells that were transf ected with the vector to be sel cted 
for. The DNA selectable marker segment confers upon a 
cell transf ected with said vector, the ability to survive 
in the presence of a selected compound or selected group 
of compounds. The compound may be either 6418 or 

10 hygromycin B. Also, other selectable marker segments will 

contain DNA encoding an enzyme capable of functionally 
replacing a mutated enzyme so as to render the transf ected 
cell resistant to said selected compound or selected group 
of compounds. The enzyme may be selected from a group 

15 consisting of: thymidine kinase, xanthineguanine 

phosphor ibosyl transferase, adenine 
phosphoribosyltransf erase, adenosine deaminase and 
dihydrofolate reductase. 

20 This cloning vector should also include a multi-use 

multiple cloning site to facilitate recovery for genetic 
modification and analysis and insertion for reintroduction 
into cells for replication and expression. Multiple 
cloning cassette sequence cartridges are commercially 

25 available from several different companies (Promega, New 

England Biolabs, etc) • A typical cassette sequence would 
include restriction sites for 8-11 different enzymes 
(i.e. Eco RI, Sac 1, Sma 1, Ava I, Bam HI, Xba 1, Hinc II, 
Acc 1, Sal 1, Pst 1, Hind III, etc.) The availability of 

30 these cassette sequences are known to those skilled in the 
art. 

This cloning vector should also include a DNA segment 
encoding bacterial components n c ssary for propagation of 
35 said vector in bact ria. Bacterial compon nts that we 
ss ntial for propagation of the cloning vector in 
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bact ria are known to those skilled in this art. For 
example, two bacterial components essential for bacterial 
propagation are a replicon that is responsible for 
initiation of replication and antibiotic resistant markers 
5 (i.e. ampicillin, tetracycline, etc.) that permits growth 

in specific antibiotics. 

In addition to the above described five different 
components included in the unique cloning vector, a unique 
10 expression vector capable of expressing large pieces of 
DNA (40 - 400 kb) should also include, a promoter, a 
polyadenylation site and a splice site in spacial relation 
to allow efficient expression of a structural gene. 

15 The choice of promoters to be included in this vector 

will depend on the mammalian host cell employed. It is 
advantageous to employ a compatible promoter with regard 
to the cells that the desired protein will be express d 
in. The inventors prefer to employ promoters derived from 

20 the following genes (although other promoters would b 
satisfactory) : cytomegalovirus, SV-40, Rous sarcoma 
virus, thymidine kinase, bet a -act in, metallothionein, and 
the epidermal growth factor receptor gene isolated from a 
DiFi episome. 

25 

For the purpose of this invention, a polyadenylation 
site refers to the site at which a poly A tail (a str tch 
of 50 to 300 adenines) is added to the vector for 
efficient expression of a desired protein in a mammalian 
30 cell. Also, the phrase "splice site M refers to a 

bacterial processing site essential to remove introns 
incorporated into the bacterial plasmid. These components 
are essential for optimal expression of a d sir d protein. 

35 A furth r embodiment of this inv nti n is an 

artificial chromosom consisting of a DNA segment deriv d 
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from a non-viral episome, said segm nt containing an 
origin for DNA replication, a DNA segment derived from a 
non-viral episome, said segment containing a DNA sequence 
which confers upon said vector the ability to be stably 
5 maintained extrachromosomally in a cell transfected with 
said vector, a DNA segment containing a multiple cloning 
site, a DNA selectable marker segment conferring upon a 
cell transfected with said vector, the ability to survive 
in the presence of a selected compound or selected group 
10 of compounds, a DNA segment encoding bacterial components 
necessary for propagation of said vector in bacteria, a 
promoter, a polyadenylation site, a splice site, a DNA 
segment encoding a centromere and a DNA segment encoding a 
telomere. 

15 

Further in accordance for this invention is a 
substantially purified non-viral episome of human origin 
capable of stable extrachromosomal maintenance and of 
autonomous replication in a compatible mammalian cell 
20 line. 

Further in accordance for this invention is a 
substantially purified episomal DNA segment containing an 
origin of replication. This invention further includes a 

25 substantially purified* episomal DNA segment containing a 

DNA sequence, which confers upon a vector including said 
segment, the ability to be stably maintained 
extrachromosomally in a cell transfected with said vector. 
Another embodiment of this invention is a substantially 

30 purified, episomal DNA segment containing both an origin of 
replication and a DNA sequence, which confers upon a 
vector including said segment, the ability to be stably 
maintained extrachromosomally in a c 11 transfected with 
said v ctor. 



35 
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Another embodim nt of this invention is a DNA segm nt 
containing an origin for DNA replication is from an 
episome isolated from DiFi colorectal cell line. 

5 Another embodiment of this invention is a DNA 

sequence which confers upon said vector the ability t be 
stably maintained extrachromosomally in a cell transf ected 
with said vector is from an episome isolated from DiFi 
colorectal cell line. 

10 

Another embodiment of this invention is a DNA segment 
containing an origin for DNA replication and a DNA 
sequence which confers upon said vector the ability to be 
stably maintained extrachromosomally in a cell transfected 
15 with said vector is from an episome isolated from DiFi 
colorectal cell line. 



The various techniques which have been successfully 
applied to the cloning and expression of many genes in a 
20 variety of host systems, employing many different 

promoters and vectors, are known to those skilled in the 
art of recombinant DNA technology and could be applied to 
the embodiments described herein. 

25 For the purpose of this invention, the phrase 

"operatively spaced with respect to a desired gene 9 * is 
defined as the appropriate positional spacing required 
between the numerous cloning and expression vectors 
components described in this invention so as to allow each 

30 of the of components to achieve its desired function. 

These components are also directionally positioned 5' to 
3'. The appropriate spacing needed for efficient cloning 
or expression of a desired gene is determined for each 
individual vector. 



35 
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In terms of transf ecting eukaryotic cells with these 
unique cloning or expression vectors , the transf ection 
techniques are standard and known to those skilled in the 
art of recombinant DMA technology. In terms of 
5 transf ecting cells with the unique expression vector, this 
invention could also be applied for the production of 
stable cell lines which are, by definition, continuously 
producing the desired protein. The production of cell 
lines designed to continuously produce the desired protein 
10 has been described extensively in the literature, and is 
therefore known to those skilled in the art. 

CHARACTERISTICS OF THE DEPOSITED CELL LINE 

Cell line "DiFi" comprising cells obtained from th 
15 ascitic fluid of a colorectal tumor in a patient with 

Gardner's syndrome, is available from the ATCC, accession 
# CRL 10576. This cell line retains 50 copies or more of 
extra chromosomal episomes, each of which contains at least 
one complete copy of the epidermal growth factor receptor 
20 gene. 

Pig. 1. In situ hybridization of DiFi cells with EGFR- 

A portion of a metaphase from DiFi cells stained with 
Giemsa (A) , fluorescence visualization of in situ 
25 hybridization using biotinylated EGFR as probe and 

counter stained with propidium iodide (B) , and a black and 
white print of the fluorescence pattern of in situ 
hybridization (C) . 

30 Fig. 2. Electroohoretic mobilization of EG FR genes bv 

gamma irradiation. 
Autoradiogram of a Southern blot of a TAFE gel 
hybridized with 32P-labeled EGFR , Origin (o) is indicated 
at the top as is th dir ction of migration. Plug sampl s 
35 1-8 wer xposed to 0, 5, 10, 20, 40 80, 160, 320 Gray, 
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respectively. Hybridization membranes wer exposed to 
film for 24 hrs. 

Fig. 3. Effect of aamma irradiation on the 
5 electroph oretic mobilization EGFR in A43X, 

DjFi, a*nd Ifefra cells. 
Autoradiogram of a Southern blot of a TAFE gel 
hybridized with 32P-labeled EGFR . Origin and direction of 
migration is as in Fig. 2. A431, DiFi and HeLa cell DNA 
10 plugs were irradiated with A, OGy, B. 10 Gy, C. 40 Gy, D. 

160 Gy. Autoradiographic exposure was extended to 72 hr 
in order to enhance sensitivity for detecting any 
fragments that might have migrated from the A431 plugs. 

15 Fig. 4. CHEF analysis of EGFR in gamma irradiated DiFi. 

Plugs containing DiFi DNA were exposed to 31.4 Gy 
prior to electrophoresis. The analysis of control (c) and 
irradiated (R) samples was performed in duplicate. 
Approximate sizes of the observed fragments, in kbs, ar 

20 indicated to the right. 

INTRODUCTI ON TO THE DISCLOSED INVENTION 2 
AUTONOMOUSLY REPLICATING. STABLY MAINTAINED 
MICROCHR OMOSOMAL UNITS FROM HUMAN TUMOR CELL LINES 
25 In developing the invention, we elected to use stably 

maintained extrachromosomal units arising in some 
eukaryotic cell lines as starting material, because thes 
units contain all the genetic regions required for 
autonomous replication and extrachromosomal expression. 
30 Those steps are described below. 

In initial studies, the episomes are isolated from 
the rigin in a substantially purif i d form and the 
minimal essential elements f r pisomal r plicati n and 
35 transcription are localized and isolated. Those elements 
are th n ligated into a sel ct d DNA mol cul , together 
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with additional DNA segments, including, for example, 
selectable markers, multiple cloning site or sites, 
segments necessary for propagation in bacteria and/ or a 
promoter enhancer, splice site and polyadenylation site. 

5 

Replication of nuclear DNA in eukaryotes appears to 
be under precise and reproducible control, such that it is 
replicated only once in each s-phase, the DNA synthetic 
portion of each cell division cycle. In addition, each 
10 portion of the genome replicates at the same time in each 
S-phase, with expressing (transcribed) genes replicating 
early and non-expressing and/or structural DNA replicating 
late. 

15 The genomes of prokaryotes, viruses, and yeast 

contain DNA sequences called origins, that serve as sites 
for initiating cycles of DNA replication* By analogy, . 
such sites define replicating units, or replicons, in 
eukaryotic cells such as human cells. 

20 

An accepted working hypothesis is that the eukaryotic 
nucleus is organized into structural domains in which the 
nuclear matrix plays an essential role in organizing 
chromatin structure and in regulating function. Support 

25 for this hypothesis comes from studies demonstrating that 

DNA replication, DNA repair, transcription and 
post-transcriptional processing are associated with the 
nuclear matrix. Other studies have shown that DNA 
polymerase, RNA polymerase II, expressing and expressible 

30 genes, transcriptional enhancer sequences, topoisomerase 
II cleavage sites, topoisomerase II, and heterogeneous 
nuclear RNA (hnRNA) splicing complexes are highly enriched 
or specifically localized in the nuclear matrix. 

35 The fact that regulatory DNA sequ nces and the 

nuclear pr t ins with which they interact have not b en 
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identif ied is in part attributable to th unmanag able 
size of chromosomes and the complexity of the genetic 
elements they contain. However, stable cell lines are 
occasionally established in which regions of specific 
5 genes have been amplified (Stark, Cancer Surveys, 5:1-23, 
1986) and occasionally are segregated into autonomously 
replicating components. These exist in the nucleus as 
episomes (200 kb - 800 kb molecules) and/or light 
microscope-visible double minute chromosomes (dm ins, >1000 
10 kb) . 

This invention exploits these cell lines by isolating 
and investigating the structure and replication control of 
their extrachromosomal elements in order to identify DNA 

15 sequences required to ensure their autonomy for stable 

maintenance, replication and gene expression. This 
minimal essential structure should then provide the core 
structure with which to assemble a cloning and expression 
vector for genes exceeding sizes accommodated by cosmid 

20 vectors. 

Although the methodology described herein contains 
sufficient detail to enable one skilled in the art to 
practice the present invention, a commercially availbale 

25 technical manual entitled MOLECULAR CLONING (Maniatis, et. 
al., Cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York) may provide some additional details useful to 
assist practice of some aspects of this invention. 
Accordingly, this manual is incorporated herein by 

30 reference. 

The following examples are designed to illustrate 
c rtain aspects of th pres nt invention. Howev r, 
th y should not be construed as limiting th claims 
35 thereof. 
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EXTRACHROMOSOMAL AMPLIFICATION 
OF THE EPIDERMAL GROWTH FACTOR RECEPTOR GENE 
IN A HUMAN COLON CARCINOMA CELL LINE 

5 This example describes the isolation and 

characterization of an autonomously replicating episomal 
unit derived from a human colorectal carcinoma cell, 
established from ascites from a patient with Gardner's 
syndrome, designated w Difi M (Bowman, et al., In: 

10 Hereditary Colorectal Cancer , J. Utsunomiya and H. Lynch 
(Eds.)/ Springer-Verlag, In Press, 1990) • The invention 
is not limited to the w Difi M episome, however, for the 
basic procedures provided by the present disclosure should 
enable those of skill in the art to develop vectors from 

15 the epi somes of other cells* 

DiFi cells were (1) successfully established in 
tissue culture, (2) shown to contain amplified EGFR genes 
and mRNA, and (3) characterized cytologically to be near 
20 tetraploid with the presence of double minutes (dmin; 
Bowman et al. In Hereditary Colorectal Cancer, J. 
Utsunomiya and H. Lynch (eds) , SpringVerlag, In Press, 
1990) . 

25 Xi. CELL LINES EMPLOY ED AND CELL CULTURE CONDITIONS 

A431 (obtained from Gary Gallick, M. D. Anderson 
Cancer Center) , HeLa and DiFi cells were maintained in 
Dulbecco's medium supplemented with 5% fetal and 5% 
newborn calf serum. SW480 cells, a colon tumor cell lin 

30 (established by Leibovitz, 1976 and obtained from Mark 

Blick, M. D. Anderson Cancer Center) were grown and 
maintained in L-15 medium containing L-glutamine and 
supplemented with 10% fetal calf serum, insulin (5ug/ml) 
and glutathion (I6ug/ml) . 

35 
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A - Characteristic of a human col rectal cancer cell 
line fDlFit 

"DiFi" colorectal carcinoma cell line represents one 
of the first cell lines to be established and 
5 characterized from a patient with Gardner syndrome. 

Malignant ascitic fluid cells were isolated from a 46 
year old female rectal cancer patient with Gardner 
syndrome and initiated to grow in culture. The cells have 

10 been maintained in culture for over three years. Hoechst 
stain analysis for mycoplasma was negative. Subcutaneous 
injection of DiFi cells into athymic mice demonstrated 
tumor production in 50% of the mice. The cells have a 
tetraploid karyotype, and possess an isozyme pattern 

15 characteristic of colorectal cancer cell lines. 

XX^ OF EGFR DNA IK m'F< CEI.T,S BY TW « TT tt 

HYBRIDTZATTOM 

The following studies demonstrated the episomal 
20 location of the amplified EGFR gene. 

Slides containing metaphase cells from either DiFi or 
SW480 cells were prepared and stored at room temperature. 
Prior to jn sifru hybridization with a biotinylated EGFR 

25 probe, the slides were stained (six minutes in 5% Giemsa 

prepared in phosphate buffer pH 6.8) and photographed. In 
Sifca hybridization involved treating the photographed 
slides with RNAse, DNA denaturation and dehydration 
solutions, overnight incubation in a hybridization mix 

30 containing a biotinylated EGFR probe, and tagging the 

regions of EGJS hybridization with f luorescein-avidin and 
biotinylated goat anti-avidin. This procedure resulted in 
a three lay rs of fluor scein-avidin, and visualization by 
fluorescence microscopy (Pinkel et al., Proc. Natl. A Ga d. 

35 Sci. USA. 83:2934, 1986). 
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The EGFR CDNA probe, HER-A64-3 (Ullrich et al. , 
Nature 309:418-425, 1984) , was labeled by nick translation 
with biotin-7-dATP according to the instructions provided 
by BRI*. Hybridization mix (25 ul) containing 10% PEG 6000 
5 and 5 ng of probe was used on each slide. Following in 
situ hybridization and fluorescence labeling procedures, 
slides were rinsed and counterstained in propidium iodide 
(2 ug/ml in HjO) for two minutes, rinsed with H 2 0, and 
carefully blotted dry. Two drops of antifade solution 
10 (Johnson and Aroujo, rr_ Imfaurjol^ Methods 43:349-350, 1981) 

were added to each slide before covering with a coverslip. 
Metaphase chromosomes were photographed under epi-UV- 
illumination on Kodak Ektachrome 160 film using the Zeiss 
filter 25 combination 48 77 09. 

15 

Giemsa-stained metaphase chromosomes from DiFi cells 
revealed a background of extrachromosomal particles at the 
limit of optical resolution (Fig. 1A) . Occasionally, they 
were paired in the form of small drains. To determine 

20 whether these structures contained copies of the EGFR 
gene, the biotinylated A64-3 cDNA EGFR probe was 
hybridized to these metaphase cells. SW480 cells served 
as a negative control because their dmins are amplif i d 
for MYC rather than EGFR (Untawale, Masters Thesis on File 

25 at the Graduate School* of Biomedical Sciences, University 
of Texas Health Science Center, Houston, Texas, 1987; 
Untawale and Blick, Anticancer Res. 8:1-8, 1988). 
Thirty-five SW480 metaphase cells were examined for 
hybridization with biotinylated A64-3 cDNA EGFR probe. No 

30 hybridization was observed to any metaphase chromosome or 
extrachromosomal entity (data not shown) . The same 
analysis was performed with DiFi metaphase spreads and 
thirty-three out of sixty-six demonstrated strong 
hybridization to extrachromosomal regions. No conclusions 

35 could b drawn from th remaining thirty-thre metaphase 
cells due to weak hybridization or high background. 



WO 92/07080 



PCT/US91/07690 



-19- 

Figur 1 presents in situ hybridization of DiFi 
metaphase cells with EGFR probe. A portion of a metaphase 
spread from DiFi cells was stained with Giemsa (1A) . 
Fluorescence visualization of in situ hybridization using 
5 biotinylated EGFR as a probe and counter stained with 
propidium iodide is shown in IB, and a black and whit 
print of the fluorescence pattern of in situ hybridization 
is shown in 1C. 

10 In the Geimsa stained metaphase (1A) the chromosomes 

are intensely stained in contrast to the diffuse staining 
of extrachromosomal material in the background* The 
extrachromosomal background appears to be dmin, which vary 
in their size and visibility. Hybridization of the 

15 biotinylated EGFR probe (yellow fluorescence) was limited 
to extrachromosomal regions containing dmin, rather than 
chromosomal DNA (IB) • In ord^r to emphasize the 
extrachromosomal hybridization the photograph was printed 
in black and white (1C) . In Figure 1C, the 
20 extrachromosomal labeling was visualized more clearly 

since the fluorescein fluorescence is more intense in dmin 
than isothe propidium fluorescence from the chromosomes. 

Therefore, in situ hybridization of the biotinylated 
25 EGFR probe in the DiFi cell line demonstrated localized 

hybridization predominantly in extrachromosomal regions 
rather than to chromosomal DNA. 

The in situ hybridization analysis presented in 
30 Figure IB and 1C consistently demonstrated specific 

biotinylated EGFR localized in the extrachromosomal 
background. This specific localization is most likely 
associated with pisomes many of which are to small in 
size and disorganiz d in structur to b visualiz d as 
35 dmins in standard cytogen tic spreads. 



WO 92/07080 PCT/US91/07690 

-20- 

TTT. PREPARA TION AND IRRADIATION OF DNA 

After confirming that the EGFR amplification observed 
in the DiFi cells was mediated by a stable episomal 
fraction, we next sought to isolate that fraction from the 
5 cells using the procedures described below. 

Cells were embedded, lysed and deproteinized in 
agarose blocks in order to minimize shear damage to the 
DNA (Smith et al. , In Methods in Enzvmoloav. M. Gottesman 

10 (Ed.), Academic Press, San Deigo, Vol 151, p. 461., 1987) • 

Agarose blocks, with each sample containing approximately 
3 ug of DNA, were cut to fit gel slots. Samples were 
suspended in 1 ml of TAFE buffer (10 mM Tr is-acetate , pH 
8.0; 0.5 mM EDTA) in 12 x 75 mm polystyrene culture tubes 

15 and exposed to m Cs gamma rays at a dose-rate of 45 

Gray/min to linearize the DNA for pulse field 
electrophoresis (van der Blick et al., NAR 16:4841-4851, 
1988; Beverly, NAB 16:925-939, 1988; Ruiz et al., yiol. 
Cell. Biol. 98:109-115, 1989). The inventors exposed 

20 agarose plugs containing unsheared DiFi cellular DNA to 

varying doses of gamma radiation prior to analysis by 
pulse-field gel electrophoresis. Appropriate levels of 
exposure were estimated based on an expected yield of 1.1 
x 10** double-strand breaks/Gy/bp (calculated from Krisch 

25 et al., Rad. Res. 101:356-372, 1985). 

iv, pulsed— fie ld SEL electrophoresis was employed to size 

DNA 

Following irradiation, the samples were loaded into 
30 1% agarose gels and subjected to transverse alternating 
field electrophoresis (TAFE) using TAFE buffer in a 
GeneLine system (Beckman Instruments) • Agarose plugs 
containing y ast chromosomes or concatem rs of lambda 
phage DNA w re included on gels as size standards. 
35 Initial curr nt was h Id constant at 170 ma for 30 min, 

reorienting the direction of the electrical field every 4 
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sec, followed by a constant current of 150 mA for 18 hr 
with a field reorientation interval of 60 sec. 

Some experiments employed the clamped homogeneous 
5 electrical field (CHEF) protocol for pulsed-field gel 
electrophoresis (Chu et al., Science 232:65-68, 1986). 
Here, electrophoresis was performed in 0.5x TBE buff' r (45 
mM boric acid, 45 mH Tris and 2 mM EDTA, pH 8.3) at a 
constant current of 70 volts reoriented every 15 min for a 
10 total of 3 days. 

SOUTHERN TRANSFER A ND HYBRIDIZATION 
Upon completion of electrophoresis, staining (0.5 
ug/ml ethidium bromide) , and photography, gels were 

15 irradiated for 5 min with 254 nm UVL (Gelman Instrument 

Co., Model 51438). This was followed by gentle shaking in 
0.25 M HC1 for 5 min for depurination, rinsing in 
deionized water, soaking in 0.4 M NaOH for 1 hr for 
hydrolysis of depurinated bases, rinsing in deionized 

20 water, and soaking in 0.2 M NaOH, 0.6 H NaCl for 1 hr for 
denaturation. The DNA was transferred to a Zetabind nyl n 
membrane (AMF Cuno, Inc.) in the denaturing solution for 
15-20 hrs. The filter was then treated with two 15 min 
washes in a neutralizing solution (0.5 M Tris-HCl, pH 7.5; 

25 1.5 M NaCl) and dried in a vacuum oven at 80°C for 1 hr. 
Labeling of probe, hybridization to filters and 
autoradiography for visualization of fragments were 
performed as previously described (Amasino, Anal . B iochem . 
152:304-307, 1986; Liu et al.. Science 246:813-815, 1989). 

30 

Figure 2, an autoradiogram of a Southern blot of a 
TAFE gel probed with 3 2P- labeled EGFR. demonstrates 
lectrophoretic mobilization of EGFR genes by gamma 
irradiation. Th rigin (o) as w 11 as the dir ction of 
35 migration is indicat d at th top of the figure. Plug 

samples 1-8 were expos d to 0, 5, 10, 20, 40 80, 160, 320 
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15 



30 



35 



Gy, respectively. Hybridization m mbranes wer exposed to 
film for 24 hrs. 

Southern analysis of a gel hybridized with an EGFR 
5 probe demonstrated the dose dependent migration of two 

different sized fragments containing EGFR sequences (Pig. 
2) . The pattern of migration of total DNA was observed by 
staining gels with ethidium bromide (data not shown) . 
Dose-dependent increases were observed in the amount of 
10 random sized DNA fragments migrating between the sample 

well and the front of each lane. Increased amounts of DNA 
also accumulated in the zone representing molecules of 
2500 kb or larger under the electrophoresis conditions 
employed. The EGJB-containing fragments migrated at a 
position consistent with approximately 650 kb and 1300 kb 
representing faster and slower migrating forms, 
respectively. The origin is indicated by »o.» 

Figure 3, an autoradiogram of a Southern blot of a 
TATE gel probed with 32P-labeled mm. demonstrates the 
effect gamma irradiation has on the electrophoretic 
patterns of migration of EGFR sequences in A431, DiFi, and 
HeLa cells. The origin and direction of migration are as 
in Fig. 2. DNA plugs from A431, DiFi and HeLa cells were 
irradiated with increasing amounts of radiation: Lane 
(A) : OGy; Lane (B) : 10 Gy; Lane (c) : 40 Gy; Lane (D) : 160 
Gy. The autoradiographic exposure was extended to 72 hr 
in order to enhance sensitivity for detecting any 
fragments that might have migrated form the A431 plugs. 

Dose dependent increases were observed in the amounts 
of randomly broken DNA fragments migrating from sample 
wells into ach lane. As is observed, EGFR amplification 
is much high r in DiFi DNA and A431 DNA when compared to 
HeLa DNA. More importantly, sampl plug irradiation did 



20 



25 
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not release discrete sizes of HeLa and A431 EGFR sequ nc s 
were (confirmed by exposing autoradiograms for 7 days, 
data not shown) . However, mobilization of both the 650 Jcb 
band and 1300 kb band DiFi EGFR fragments were readily 
5 detected. To summarize, EGFR sequences in both HeLa and 
A431 DNA appear to be chromosomal ly localized. In 
contrast, £SER sequences in DiFi DNA appear to be 
episomally (extrachromosomally) localized and may be 
substantially purified by the procedure described here. 



10 



20 



Figure 4 presents CHEF analysis of EGFR from gamma 
irradiated DiFi DNA. Plugs containing DiFi DNA were 
exposed to 31.4 Gy prior to electrophoresis. The analysis 
of control (c) and irradiated (R) samples was performed in 
15 duplicate. Approximate sizes of the observed fragments, 
in Jcbs, are indicated to the right. Irradiating DiFi 
plugs and conducting CHEF electrophoresis under conditi ns 
that resolve larger DNA fragments revealed the presence of 
a weakly hybridizing band of approximately 2,000 kb, in 
addition to the 650 kb and 1300 kb fragments (Fig. 4). In 
unirradiated control lanes (C) a small portion of 
E£FR-containing molecules were observed to have migrated 
into the gels. This observation was previously attribut d 
to degradation of cellular DNA during the preparation f 
25 agarose plugs (van der Blick, et al., fi&B 16:4841-4851, 
1988) . 

VI. SUMMARY 

In Situ hybridization, using a biotinylated cDNA 
30 probe for the epidermal growth factor receptor ( EGFR ) 

gene, demonstrated that amplified ££££ in colon tumor c 11 
lines, DiFi, is localized to many small double minute 
chromosomes of varying siz and visibility. Analysis of 
the lectrophor tic mobility of gammairradiated DNA from 
35 DiFi by puis d-f i Id gel electrophor sis and Southern blot 
hybridization using EGFR prob , indicated that the 
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amplif ied EGFR in DiFi exists in xtrachromosomal , 
covalently-closed circular episomes, probably equivalent: 
-to dmin. Two major and one minor species were observed 
having estimated sizes of 650 kb, 1300 kb, and 2000 kb. 
5 The DiFi cell line appears to represent a unique case f 
extrachromosomal EGFR gene amplification in human cells. 
DiFi represents the first example of a stably maintained 
episome in which EGFR is amplified. 

10 EXAMPLE 2 

CONSTRUCTING A MAMMALIAN EPISOMAL 
EXPRESSION OR CLONING VECTOR 
The identification, characterization and isolation of 
DNA regulatory regions within the episomes that function 

15 a) as origins of autonomous DNA replication, and b) 

function as stabilizing regions for extrachromosomal 
maintenance will permit the construction of cloning and 
expression vectors that replicate and function as 
extrachromosomal vectors. The following is meant to s rve 

20 as one example of identifying and isolating such 

regulatory factors from the episomal unit maintained in 
human tumor cell. In some instances, reference is made to 
working with the episomal unit from DiFi cells; DiFi is 
used here only as an example. 

25 

I. IDENTIFICATION AND ISOLATION OF REGULATORY FT.KMKNTS 
IN STABLE EPISOMAL UNITS ESTABLISHED IN HUMAN TUMOR 
CEL^ I,I?*ES 

A. Episome Isolation 
30 In order to identify and isolate replication 

regulatory elements from an episome, the episome its If 
must first be isolated. 



35 



The ideal starting point is a preparation that is 
highly nrich d for the episom s f interest. A highly 
enrich d sourc of ISEB-containing episomes is the human 
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DiFi cell line. DNA will b isolat d from this enriched 
preparation and most of the DiPi genomic DMA can be 
eliminated from this preparation by employing an alkaline 
lysis modification (Griffin, et al., J. virol. , 40:11-19, 
5 1981) . An essentially pure preparation of DiPi episomes 
can then be obtained by preparative electrophoresis on 
agarose gels that permits the mobilization of covalent 
circular DNA molecules (Carroll et al., Mol. Cell. ni»T r 
7:1740-1740 (1987)). These molecules can then be 
10 recovered from the gels by procedures that dissolve or 

digest (agarose) the agarose and permit the episomal DNA 
to be purified directly from the digest. 

Determine a Restriction Map of the Bnisn^^ 

15 Genome. 

A restriction enzyme analysis will be performed after 
the episome is isolated. For example, most of the DiFi 
episome can be separated into two pieces by exploiting the 
limited number of sites susceptible to restriction enzym s 

20 Mlul (2 sites) and NotI (2 sites) . Mlul cuts at two 
closely spaced sites whereas NotI cuts at two widely 
distant sites. Table l presents macrorestriction fragm nt 
sizes of DiFi episomes digested with Mlul and NotI 
restriction enzyme. 

25 

TABLE 1 

MACRORESTRICTION FRAGMENT SIZES OF EPISOMES 
DIGESTED WITH Mlul AND NotI RESTRICTION ENZYME 
Restriction Enzyme Fragment six** 

30 Mull - 50 kb, - 600 kb 

NotI - 270* kb, ~ 380** kb 

Mlul + NotI -50 kb, -220 kb, -380 kb 

* The 3' end of this fragment contains, the 5' 
35 untranslat d regi n, exon I, and the 5' nd of 

intron I of th EGFR gen . 
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** The 5' end of this fragment contains the 

remainder of the EGFR gene from intron I through 
the 3' terminus of the gene. 

5 

Digestion of total DiFi DNA with Mlul and 
electrophoresis on agarose gels using a pulsed field gel 
electrophoresis format (Chu et al. r Science 232:65-68, 
1986) permits isolation of the region in the gel 

10 containing DNA fragments of -600 kb. Digesting the 

agarose plugs with NotI further reduces the size 
distribution pertaining to genomic DNA and also cleaves 
the DiFi episome into its expected fragments. This 
protocol yields identifiable and highly enriched DiFi 

15 episomal fragments on a background of digested genomic 

DNA. The individual episomal NotI fragments (-220 and 
-380 kb) are concentrated by electrophoresis in a second 
dimension, and then recovered from the gel by procedures 
that dissolve or digest agarose, thereby allowing 

20 purification of the desired DNA fragments for cloning. 

C. Construction of D iFi Episome Recombinant DNA 
Libraries 

1. Lambda Libraries 

25 Lambda libraries were constructed that represented 2 

to 10 kb portions of the DiFi episome by utilizing 
partially restriction enzyme digested episomes or NotI 
fragments and the Lambda-Zap phagemid vector (Short, 
Fernandez, Sorge, and Huse, Nuc. Acids Res. 16:7583-7600, 

30 1988) . 

2. Cosmid Libraries 

Cosmid libraries are constructed with BamHI partial 
digests of isolated episomes or NotI DiFi episomal 
35 fragments by utilizing the sCosl vector (Evans, et al, 

Gene , 79:9-20, 1989). Thes cosmid libraries r pr sent 
portions of the DiFi pisom in approximately 40 kb 
blocks . 
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3. PI Libraries 

Recombinant DNA libraries containing portions of the 
DiFi episome are constructed by utilizing the PI 
bacteriophage based cloning vector (Sternberg, Proc. Nat, 
5 Acad. Scl. USA . 87:103-107, 1990). This PI library 

contains DiFi episomal portions representing two siz 
ranges: less than 30 Jcb and approximately 85 - 110 kb. 

4. Plasmid libraries 

10 Recombinant DNA libraries containing portions of the 

DiFi episome are constructed utilizing an E. coli F sex 
factor based cloning vector (Leonardo and Sedivy, 
Biotechnology , 8:841, 1990). This F plasmid library 
contains DiFi episomal portions up to at least 150 kb. It 

15 should be understood that other plasmid libraries can be 

constructed using one of several available plasmid vectors 
(i.e. pKS, pT7\T3a-18, etc.). These vectors are known to 
those skilled in this art. 

20 Identification of Fmictjopal Regions Wjtfri*l 

ppjgppeg Regulating PFft Replication 
In order to identify distinct episomal regions for 
replication, various portions of recombinant DiFi episomal 
DNA libraries (from the above section) are first 

25 introduced into appropriate mammalian host cells (Krysan, 

et. al., Mol. Cell. Biol. , 9(3):1026, 1989). Autonomously 
replicating segments from the DiFi episome are first 
identified and the isolated segment is incorporated into a 
cloning or expression vector. Any transfection method may 

30 be employed for introducing portions of the recombinant 

library into mammalian host cells (i.e. calcium phosphate 
transfection (Chen and Okayama, Molec. Cell. Biol. 
7:2745-2752 (1987)); electroporation (Chu t al., Nucl 
Acids Res, 15:1311-1326 (1987)). 
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For example, pools of approximately 10 different 
plasmid vector clones from the DiPi Cosl library are 
introduced into for example, HSF56 human primary 
fibroblast cells via calcium phosphate transfection or 
5 electroporation. Each Cosl vector clone contains a 

selectable marker that confers drug resistance to 6418, 
for example. Retention and replication of trans fected 
clones are identified by growing the transf ected 
population of HSF56 cells in the presence of G418, a 
10 compound which specifically selects for cells that are 
neomycin resistant. The cells are placed under 6418 
selection 2 days after transfection, and 6418 resistant 
populations are grown for at least two months by 
maintaining the resistant clones appropriate subculturing 
15 techniques known to those skilled in the art of tissue 

culture. 

Neomycin resistant clones that persist for several 
cell divisions therefore contain a DiFi Cosl vector clon 
that is replicating. A persistent neomycin resistant cell 
clone is recovered and low molecular weight DNA (less than 
120 kb) is isolated by the HIRT extraction method (Hirt, 
J. Mol. Biol f , 26:265-369, 1967). The DMA isolated from 
this neomycin resistant cell clone will be subcloned into 
plasmid vectors that accommodate smaller inserts, such as 
the pKS vector or the pT7/T3a-18 vector, which, 
preferably, will also contain a selectable marker, such as 
a gene encoding beta lactamase, which confers resistance 
to ampicillin. 

The result of this will be another plasmid library 
which includes specific regions, one or more of which 
contain an origin for DNA replication. The clones from 
this new library will n xt b introduced into bacteria and 
bact rial col nies resistant to, for example, ampicillin, 
will be isolated. In a preferred embodiment, the host is 
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an E. coli c 11 of a typ which is compatibl with the 
vector type. 

To determine if the DNA in the bacterial colon! s 
5 contain an origin of DNA replication, the DNA from th 
ampicillin resistant bacterial colonies will be 
transfected into a mammalian cell line. The DNA (isolated 
with the HIRT extraction method) from the transfected 
mammalian cells will be analyzed by the Dpn I digestion 

10 (Krysan et al., Molec. Cell, Biol, 9:1026-1033, 1989 which 

is incorporated herein by reference) . DNA exhibiting the 
bacterial methylation pattern is cleavable by Dpn I 
restriction enzyme while DNA with mammalian methylation 
pattern is not. Thus, DNA that is not digested by Dpn I 

15 has replicated in the mammalian cell. The origins for DNA 
replication will then be identified within the inserts in 
autonomously replicating clones. The origin can then b 
removed from the vector, and inserted into the recombinant 
cloning vector. Vectors that include regions, from the 
20 DiFi episome are designated pDFE ori+ and will serve as 

the recipients for inclusion of other regions of the DiFi 
episome conferring episome maintenance. 

Identification of Functional Regions within 
25 Episomes Regulating Extra chromosomal Maintenance 

Identifying those individual clones that contain a 
region conferring extrachromosomal stability is determined 
by long term culturing (longer than two months) in the 
presence of a selection drug. The clones that surviv the 
30 continuous exposure to the selection drug must contain a 
region that confers extrachromosomal stability. 

Bri fly, clones that persist during s v ral cell 
division cycles will also be evaluated to id ntify regions 
35 within pisomal DNA that conf r stability for maintenanc 
of extrachromosomal molecul s. The proc dur by which 
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is lation of this region is essentially the same one as 
described for identifying the replication region, except 
that vectors containing DiFi episomal origins of 
replication will be used to clone other restriction 
5 fragments from the DiFi episome. Once the first round of 
drug resistant cell colonies are identified, the episomal 
DNA may be isolated and introduced into bacteria and 
bacterial colonies resistant to, for example, ampicillin, 
will be isolated. 

10 

To determine if the DNA in the bacterial colonies 
contain a region conferring extrachromosomal stability, 
the DNA from the ampicillin resistant bacterial colonies 
will be transfected into a mammalian cell line. The DNA 
15 (isolated with the HIRT extraction method) from the 

transfected mammalian cells will be analyzed for fragm nt 
size and, depending on that size, another cycle may be 
initiated to further reduce the size of the piece of DNA 
that confers the extrachromosomal stability. 

20 

In addition to evidence for extrachromosomal 
stability that is provided by the vector's provision of 
drug resistance, the intranuclear localization of vector 
episomes will be evaluated. Vector-containing cells are 

25 treated with the non-ionic detergent Triton X-100 and 2M 
NaCl. This treatment produces salt extracted residual 
nuclei, called nucleoids, which cam be centrifuged into a 
pellet at low speeds. Vectors associated with the nuclear 
matrix will pellet with the nucleoids; if they do not 

30 pellet with the nucleoids they will remain in the 
extracts' super nate. 

Both the region for the origin of DNA replication and 
for extrachromosomal maintenanc will be d fined as the 
35 core structures of both the cloning and expression vector 
and will be d signated PDFE ori + mat 41 *. 
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Construction of Optimal Eukarvotic Cloning 
Vector to Accommodate 40 kb - 400 kb Pieces of 
DNA. 

Once the core structure is determined, construction 
5 of the optimal eukaryotic cloning or expression vector 
will be completed* This is accomplished by adding the 
following three features to the core structure (these will 
be discussed below) : 

10 a. a DNA or genomic DNA region encoding at least 

one selectable marker; 

b. a DNA or genomic DNA region encoding a multiple 
cloning site; and 

15 

c. a DNA or genomic DNA region encoding bacterial 
components necessary for propagation of the 
vector in bacteria. 

Selectable markers, for mammalian cells, confer 
resistance to a specific selection agent once DNA 
conferring the resistance is transf ected into individual 
cells possessing a genetic inheritance pattern appropriat 
for the selectable marker being used in the vector. There 
are a variety of different dominant and recessive 
selection agents known to those skilled in the art. Any 
one of the following genes and agents should be effective 
in terms of employing a selection system: 

30 □ G418 resistance is selected by exposure to 

medium containing 100 to 800 ug/ml 6418. 6418 
selects for cells deficient in the enzyme 
aminoglyc sid phosphotransf rase and are 
referred to as neomycin r sistant cells. 

35 (South m and Berg, J a tfoJLec, Appl- <?ey» f , 
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1:327-341, 1982; Colbere-Garapin et al., J. 
MQlec. Biol.. 150:1, 1981). 

HAT resistance for forward selection 
(converting a thymidine kinase minus cell to a 
thymidine kinase positive cell) is selected with 
complete medium supplemented with 100 uM 
hypoxanthine, 0.4 uM aminopterin, 16 uM 
thymidine and 3 uM glycine. HAT medium selects 
for variants defective in either 
hypoxanthine-guanine phosphoribosyl-transf erase 
or thymidine kinase (Littlef ield, Proc. Natl. 
Acad. Sci. USA , 50:568, 1963; Littlef ield, 
Science . 145:709-710, 1964). 

Hygromycin B resistance is selected by 
exposure to complete medium supplemented with 10 
- 400 ug/ml hygromycin B. Hygromycin B selects 
for variants defective in the enzyme 
hygromycin-B-phosphotransf erase (Gritz and 
Davies, Gene . 25:179-188, 1983; Santerre, t 
al.. Gene . 30:147, 1984; Palmer, et.al., Proc. 
yatl. Acad y Sci, USA, 84:1055-1059, 1987). 

Adenine phosphoribosyltransf erase (APRT) 
positive variants sure selected by exposure to 
medium supplemented with 25 uM alanosine, 50 uH 
azaserine and 100 uH adenine (Lowy, et. al., 
Cell, 22:817, 1980; Adair, et. al., pyoc, N^tl. 
frcaicU Sci. PPA, 86:4574-4578, 1989). 

Xanthine - Guanine 
Phosphoribosyltransf erase (XGPRT) positiy 
variants ar selected with complete m dium 
supplement d with dialyzed fetal calf serum, 250 
ug/ml xanthine, 15 ug/ml hypoxanthin , 10 ug/ml 
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thymidin , 2 ug/ml aminopt rin, 25 ug/ml 
mycophenolic acid, and 150 ug/ml L-glutamine 
(Mulligan and Berg, Proc . Natl . Acad . Sci . USA , 
78:2072-2076, 1981). 

□ Methotrexate resistance is selected by 
exposure to complete medium supplemented with 
0.01 uM - 300 uM methotrexate and dialyzed fetal 
calf serum. Methotrexate selects for cells 
expressing high levels of dihydrofolate 
reductase (O'Hare, et al. , Proc. Natl, Acad. 
Sci. USA . 78:1527, 1981; Simon sen and Levinson, 
Pi-q. Natl. Acad, Sci. USA, 80:2495-2499, 1983). 

□ Deoxycoformycin resistant cells are 
selected by exposure to complete medium 
supplemented with 10 ug/ml thymidine, 15 ug/ml 
hypoxanthine, 4 uM 9-B-D- xylofuranosyl adenine 
(XylA) , and 0.01 - 0.03 uM 2 ' -deoxycoformycin 
(dCF) • This selection selects for mutants 
expressing adenosine deaminase (ADA; Kaufman, 
et. al., Proc. Natl. Acad. Sci. USA, 
83:3136-3140, 1986). 

25 For added ease in handling and manipulating, this 

optimum eukaryotic cloning vector could include a DNA 
region comprising a multiple cloning cassette sequence 
containing infrequent cutting by restriction enzymes to 
facilitate the insertion of a desired gene. Multiple 

30 cloning cassette sequence cartridges are commercially 
available from several different companies (Stratagen , 
Pr omega, New England Biolabs etc) • A typical cassette 
s quence cartridge would inc. ad r stricti n sites for 8 - 
11 diff r nt nzym s (i. • Eco Rl, Sacl, Sma 1, Ava 1, Bam 

35 HI, Xba 1, Hinc II, Acc 1, Sal 1, Pst 1, Hind III, to.). 



5 



10 



15 



20 



WO 92/07080 



t 

PCT/US91/07690 



-34- 



Th availability of th se cassette cartridges are known to 
those skilled in the art. 

The bacterial plasmid sequences may be derived f r m 
5 any one of the many different vectors that are 

commercially available and known to those skilled in the 
art of recombinant DNA technology. For the purpose of 
this invention, pUC, pKS, pBR322 and pT7/T3al8 are used as 
a matter of preference, however, other vectors would be 
10 equally effective. For example, if pBR322 sequences are 

introduced into the cloning or expression vector, the 
resulting recombinant can then be shuttled back and forth 
between E. coli and mammalian cells. 

15 The construction of an optimal eukaryotic expression 

vector that can accommodate 40 Jcb - 400 kb pieces of DNA 
will also contain, in addition to the elements described 
for the cloning vector, a DNA region containing a 
promoter, a polyadenylation and splice site necessary for 

20 the expression of the desired gene. 

There are at least two approaches for constructing an 
optimal eukaryotic cloning vector that can accommodate 40 
- 400 Jcb pieces of DNA. 

25 

1. The first and more simpler approach is to begin 
with a readily available cloning plasmid vector 
capable of propagation in bacteria. There ar 
many different vectors known to those skilled in 

30 the art that would work efficienty. Several 

different components and features can easily be 
ligated into this bacterial plasmid vector. 
These add d features are discussed below. Once 
completed, the vector will not only have th 

35 core structur (to confer th ability to 

replicat DNA and to be maintained 
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extrachromosomally) but will also have the added 
features to optimize the vector for propagation 
in bacteria and for identification of its 
presence after transfection into a mammalian 
cell recipient. 



2. The second approach involves custom designing 
andcreating the optimum cloning vector by 
ligating all the desired features and components 
10 (including the core structure) together to 

generate the vector of choice. 

Construction of a Mammalian Artifi cial Chromosome 
The episomally maintained and replicated vector pDFE 
15 ori + mat + is introduced into cells and persist as coval nt 

circular extrachromosomal molecules. In this form the 
episomes accumulate to produce multiple copies in each 
cell and accordingly, also overproduce mRNA and its 
protein product. While this is desirable for producing 
20 amplified genes and gene products, the introduction of 

cloned genes into cells for use in gene therapy requires 
the control of gene copy number and attendant gene 
expression. Such control is introduced into the DiFi 
episome vector by introducing DMA sequences that stabilize 
25 artificial chromosomes containing linear double strand d 
DMA (DNA encoding a telomere) . Such sequences occur at 
the termini of natural chromosomes; in human chromosom s 
5 9 — AGGGTT-3 ' is tandemly repeated to the extent of 10 of 
15 kb at every telomere (Blackburn, Science . 249:489, 
30 1990) . This tandemly repeated sequence is ligated to each 

end of a linearized cloning and expression vector to 
stabilize the termini. The addition of telomere sequences 
specific for other speci s provid s for the stabilization 
t artificial chromosomes when introduced into th se 
35 speci s. 
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Centr mere sequ nces are known to identify regions 
within chromosomes where kinetichores are organized and 
mitotic spindles are attached to the chromosomes, thus 
ensuring for the segregation of chromosomes during 
5 mitosis. DNA sequences that serve as centromeres are 
introduced into an internal region of the linearized 
cloning and expression vector which contain telomeres 
resulting in an artificial chromosome. This synthetic 
chromosome contains required regulatory and stabilizing 
10 DNA sequences that normally occur in natural chromosomes. 

Specific genetic function is conferred on this 
synthetic chromosome by ligating a gene of interest into 
its multiple cloning site. For example, the gene or cDNA 

15 derivative of the gene that is defective in Duchenne , s 

muscular dystrophy or myotonic dystrophy, or one of a 
number of other diseases associated with muscle 
dysfunction may be cloned into the artificial chromosome. 
The artificial chromosome is then introduced into cells or 

20 tissues or animals by methods appropriate for the target. 
The transfected chromosome is established as an integral 
component of the recipient cells where it is stably 
maintained and expressed. Recipient cells, tissues or 
animals that were initially dysfunctional because of a 

25 genetic defect they possessed are cured and become normal 
because of the expression and synthesis of the normal gen 
product introduced in the artificial chromosome. 

2L. Evaluation of Different Str ategies for 
30 Transfectina Cloning or Expression Vectors into 

MamaaXiaa 

Once the optimal cloning and expression vector is 
constructed, several different strategi s for transf cting 
the vectors will be studi d. Exampl s of pot ntial 
35 meth ds includes: (l) encapsulation of insert-c ntaining 
vect rs in liposomes of appropriate composition to enhance 
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entry into target cells, and (2) electroporation f vector 
Into mitotic cell recipients to enhance its inclusion 
within the nucleus as cells progress into 61 phase of th 
cell cycle, and (3) injection of DNA-encoated particles 
5 into cells by employing a Biolistic Particle Delivery 
System (DuPont) . This procedure essentially shoots 
DNA-coated bullets into cells or tissues. 

Biosynthetic Production of Proteins in dslls 
10 Transfected With Cloning and Expression Vectors 

Containing Isolated Genes or Functional 
Derivatives 

Medically important proteins are produced in 
mammalian cells that have been transfected with the vector 
15 containing the gene encoding the protein. Since the 

gene-containing vector accumulates in the transfected 
cells, the amount of protein produced increases as more 
vector copies accumulate. The following example 
illustrates an efficient system for protein production. 
20 To produce the product of the gene that is deficient in 
patients with myotonic dystrophy, the vector containing 
the normal gene is electroporated into a normal primary 
human fibroblast cell line HSF56, adapted for growth in 
suspension culture in serum free medium. The accumulation 
25 of the cloning vector in each cell is accelerated by 

growing the cells in the drug appropriate for the drug 
resistance gene contained in the vector. As the gene copy 
number accumulates the amount of protein increases to be 
recovered from the culture medium or from the cells after 
30 maximal growth is achieved. The medical condition of 
patients with myotonic dystrophy may be improved by 
treatment with the protein that is provided by this 
cloningexpr ssion syst m. Modification of the vector to 
include other genes and selection f targ t c lis and 
35 appropriate cultur conditions provid s ndl ss p ssible 
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systems for the production and isolation of mammalian 
proteins. 

The foregoing description has been directed to 
5 particular embodiments of the invention in accordance with 
the requirements of the Patent Statutes for the purposes 
of illustration and explanation. It will be apparent, 
however, to those skilled in this art, that many 
modifications and changes in the apparatus and procedure 
10 set forth will be possible without departing from the 

scope and spirit of the invention. It is intended that 
the following claims be interpreted to embrace all such 
modifications and changes. 



15 
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CIAIMS 

1. A composition of matter comprising a 
substantially purified non-viral episome of human origin 
capable of stable extrachromosomal maintenance and of 
autonomous replication in a compatible mammalian cell 
line. 



10 2. A substantially purified episomal DNA segment 

containing an origin of replication. 



3. A substantially purified episomal DNA segm nt 
15 containing a DNA sequence which confers upon a vector 

i. eluding said segment the ability to be stably maintained 
extrachromosomally in a cell transfected with said vector* 



20 4. A substantially purified episomal DNA segment 

containing an origin of replication and a DNA sequence 
which confers upon a vector including said segment the 
ability to be stably maintained extrachromosomally in a 
cell transfected with said vector. 



25 



30 



5. The substantially purified episomal DNA segment 
of claim 2, 3, or 4 wherein the episomal DNA segment is 
from an episome isolated from DiFi colorectal cell lin . 



6. A cloning vector comprising the following 
components operatively spaced with respect to a desired 
gene: 



35 



* 
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a) a DNA segm nt derived from a non-viral episome, said 
segment containing an origin for DNA replication; 

b) a DNA segment derived from a non-viral episome, said 
5 segment containing a DNA sequence which confers upon 

said vector the ability to be stably maintained 
extrachromosomally in a cell transfected with said 
vector; 

10 c) a DNA segment containing a multiple cloning site; 

d) a DNA selectable marker segment conferring upon a 
cell transfected with said vector the ability to 
survive in the presence of a selected compound or 

15 selected group of compounds; and 

e) a DNA segment encoding bacterial components necessary 
for propagation of said vector in bacteria. 



20 



25 



7. The cloning vector of claim 6 wherein said 
compound is selected from the group consisting of 6418 and 
hygromycin B. 



8. The cloning vector of claim 6 further including 
a DNA sequence encoding a desired protein. 



30 9* The cloning vector of claim 6 wherein the 

segment containing the origin for DNA replication is from 
an episome isolated from DiFi colorectal cell line. 



35 



10. The . cloning vector of claim 6 wher in the 
segment containing a DNA sequence which confers upon said 
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v ctor th ability to be stably maintain d 
extrachromosomally in a cell transfected with said vect r 
is from an episome isolated from DiFi colorectal cell 
line. 



11 • The cloning vector of claim 6 wherein the 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 

10 selected compound or selected group of compounds toxic to 
said cell when said selectable marker segment is not 
present in said cell and wherein said selectable marjc r 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 

15 said transfected cell resistant to said selected compound 
or selected group of compounds. 



12. The cloning vector of cl^im 11 wherein said 

20 enzyme is selected from the group consisting of: thymidine 
kinase, xanthine-guanine phosphor ibosyl transferase, 
adenine phosphor ibosyl transferase, adenosine deaminase and 
dihydrofolate reductase. 

25 

13. A cloning vector comprising the following 
components operatively spaced with respect to a desired 
gene: 

30 a) a DNA segment derived from a non-viral episome, 

said segment containing an origin for DNA 
replication and a DNA sequence which confers 
upon said vector the ability to be stably 
maintained extrachromosomally in a cell 

35 transfected with said vector; 
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b) a DNA segm nt containing a multiple cloning 
site; 

c) a DNA segment conferring upon a cell transfected 
5 with said vector the ability to survive in the 

presence of a selected compound or selected 
group of compounds; and 

d) a DNA segment encoding bacterial components 
10 necessary for propagation of said vector in 

bacteria. 



14. The cloning vector of claim 13 wherein said 
15 compound is selected from the group consisting of G418 and 

hygromycin B. 



15. The cloning vector of claim 13 further including 
20 a DNA sequence encoding a desired protein. 



16. The cloning vector of claim 13 wherein the DNA 
segment containing an origin for DNA replication and a DNA 

25 sequence which confers upon said vector the ability to be 

stably maintained extrachromosomally in a cell transfect d 
with said vector is from an episome isolated from DiFi 
colorectal cell line. 

30 

17 . The cloning vector of claim 13 wherein the 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or s lect d group of c mpounds toxic to 

35 said cell when said s lectabl mark r s gment is not 

present in said cell and wherein said s lectabl marker 
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s gment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to rend r 
said transfected cell resistant to said selected comp und 
or selected group of compounds. 



18. The cloning vector of claim 17 wherein said 
enzyme is selected from the group consisting of: thymidine 
kinase, xanthine-guanine phosphoribosyl transferase, 
10 adenine phosphoribosyltransf erase, adenosine deaminas and 
dihydrofolate reductase. 



15 



25 



30 



19. An expression vector comprising the following 
components operatively spaced with respect to a desir d 
gene: 



a) a DNA segment derived from a non-viral episome, 
said segment containing an origin for DNA 

2 0 replication ; 

b) a DNA segment derived from a non-viral episome, 
said segment containing a DNA seguence which 
confers upon said vector the ability to be 
stably maintained extrachromosomally in a cell 
transfected with said vector; 

c) a DNA segment containing a multiple cloning 
site; 

d) a DNA segment conferring upon a cell transfected 
with said vector the ability to survive in the 
presence of a selected compound or selected 
group of compounds; 
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) a DNA segment encoding bact rial components 
necessary for propagation of said vector in 
bacteria; and 

f) a promoter, a polyadenylation site, and a splic 
site in spacial relation to allow the efficient 
expression of a structural gene upon insertion 
of said gene into said splice site. 



20. The expression vector of claim 19 wherein said 
compound is selected from the group consisting of 6418 and 
hygromycin B. 



21. The expression vector of claim 19 further 
including a DNA sequence encoding a desired gene. 



20 22. The expression vector of claim 19 wherein the 

bacterial components necessary for propagation of said 
vector in bacteria are derived from pBR322, puc, pT7/T3a- 
18 or pKS. 



15 



25 



30 



23. The expression vector of claim 19 wherein the 
segment containing the origin for DNA replication is from 
an episome isolated from DiFi colorectal cell line. 



24. The expression vector of claim 19 wherein the 
segment containing a DNA sequence which confers upon said 
vector the ability to be stably maintained 
extrachromosomally in a c 11 transfected with said vector 
35 is from an pisome isolat d from DiPi colorectal c 11 
lin . 
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25. The expression vector of claim 19 wherein th 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
5 selected compound or selected group of compounds toxic to 

said cell when said selectable marker segment is not 
present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
10 said transfected cell resistant to said selected compound 

or selected group of compounds. 



26. The expression vector of claim 25 wherein said 
15 enzyme is selected from the group consisting of: thymidine 

kinase, xanthine-guanine phosphor ibosyl transferase, 
adenine phosphor ibosyltransf erase, adenosine deaminase and 
dihydrofolate reductase. 

20 

27. An expression vector comprising the following 
components operatively spaced with respect to a desired 
gene: 

25 a) a DNA segment derived from a non-viral episome, 

said segment containing an origin for DNA 
replication and a DNA sequence which confers 
upon said vector the ability to be stably 
maintained extrachromosomally in a cell 

30 transfected with said vector; 

b) a DNA segment containing a multiple cloning 
sit ; 



35 



c) 



a DNA segment conferring upon a cell transfected 
with said vector th ability to survive in the 
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presence of a s lected c mpound or s lected 
group of compounds; 

d) a DNA segment encoding bacterial components 
5 necessary for propagation of said vector in 

bacteria; and 

e) a promoter, a polyadenylation site, and a splice 
site in spacial relation to allow the efficient 

10 expression of a structural gene upon insertion 

of said gene into said splice site. 



28. The expression vector of claim 27 wherein said 
15 compound is selected from the group consisting of 6418 and 

hygromycin B. 



29. The expression vector of claim 27 further 
20 including a DNA sequence encoding a desired protein. 

30. The expression vector of claim 27 wherein tihe 
bacterial components necessary for propagation of said 

25 vector in bacteria are derived from pBR322, pUC, pT7/T3a- 

18 or pKS. 



31. The expression vector of claim 27 wherein the 
30 DNA segment containing an origin for DNA replication and a 
DNA sequence which confers upon said vector the ability to 
be stably maintained extrachromosomally in a cell 
transf cted with said vector is from an episome isolated 
from DiFi colorectal cell line. 



35 
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32. The expr ssion vector of claim 27 wh r in th 
chromosomal DNA of said transf ected cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or selected group of compounds toxic t 
5 said cell when said selectable marker segment is not 

present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
said transfected cell resistant to said selected compound 
10 or selected group of compounds. 



33. The cloning vector of claim 32 wherein said 
enzyme is selected from the group consisting of: thymidin 
15 kinase, xanthine-guanine phosphoribosyl transferase, 

adenine phosphor ibosyltransf erase, adenosine deaminase and 
dihydrofolate reductase. 

20 34. The expression vector of claim 19 or 27 wherein 

the promoter is selected from the group consisting of: 
cytomegalovirus promoter, SV-40 promoter, Rous sarcoma 
virus promoter, thymidine kinase promoter, beta-act in 
promoter, metal lothionein promoter, and epidermal growth 

25 factor receptor gene promoter isolated from a OiFi 

episome. 

35. An artificial chromosome comprising: 
30 a DNA segment derived from a non-viral episome, said 

segment containing an origin for DNA replication, a DNA 
segment derived from a non-viral episome, said segment 
containing a DNA sequence which confers upon said vector 
the ability to be stably maintained xtrachromosomally in 
35 a cell transfected with said vector, a DNA s gment 
containing a multiple cloning sit , a DNA sel ctabl 
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mark r segment conferring upon a cell trans fected with 
said vector r the ability to survive in the presence of a 
selected compound or selected group of compounds, a DNA 
segment encoding bacterial components necessary for 
5 propagation of said vector in bacteria, a promoter, a 
polyadenylation site, a splice site, a DNA segment 
encoding a centromere and a DNA segment encoding a 
telomere . 

10 

36* The artificial chromosome of claim 35 wherein 
said compound is selected from the group consisting of 
6418 and hygromycin B. 

15 

37. The artificial chromosome of claim 35 further 
including a DNA sequence encoding a desired protein. 

20 38. The artificial chromosome of claim 35 wherein 

the chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or selected group of compounds toxic to 
said cell when said selectable marker segment is not 

25 present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
said transfected cell resistant to said selected compound 
or selected group of compounds. 

30 

39. The artificial chromosome of claim 38 wherein 
said enzyme is selected from the group consisting of: 
thymidine kinase, xanthin -guanine phosphorib syl 
35 transf rase, adenine phosphoribosyltransf rase, aden sine 
d aminase and dihydrofolat reductase. 
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